arXiv:1509.07928v2 [cs.IT] 20 Apr 2016 


TO APPEAR IN IEEE TRANSACTIONS ON COMMUNICATIONS 


I 


Quantized Massive MU-MIMO-OFDM Uplink 

Christoph Studer, Senior Member, IEEE, and Giuseppe Durisi, Senior Member, IEEE 


Abstract —Coarse quantization at the base station (BS) of a 
massive multi-user (MU) multiple-input multiple-output (MIMO) 
wireless system promises significant power and cost savings. 
Coarse quantization also enables significant reductions of the raw 
analog-to-digital converter (ADC) data that must be transferred 
from a spatially-separated antenna array to the baseband pro¬ 
cessing unit. The theoretical limits as well as practical transceiver 
algorithms for such quantized MU-MIMO systems operating 
over frequency-flat, narrowband channels have been studied 
extensively. However, the practically relevant scenario where 
such communication systems operate over frequency-selective, 
wideband channels is less well understood. This paper investigates 
the uplink performance of a quantized massive MU-MIMO 
system that deploys orthogonal freqnency-division multiplexing 
(OFDM) for wideband communication. We propose new al¬ 
gorithms for quantized maximum a-posteriori (MAP) channel 
estimation and data detection, and we study the associated 
performance/quantization trade-offs. Our results demonstrate 
that coarse qnantization (e.g., four to six bits, depending on the 
ratio between the nnmber of BS antennas and the number of 
users) in massive MU-MIMO-OFDM systems entails virtnally 
no performance loss compared to the infinite-precision case at 
no additional cost in terms of baseband processing complexity. 

Index Terms —Analog-to-digital conversion, convex optimiza¬ 
tion, forward-backward splitting (FBS), frequency-selective chan¬ 
nels, massive mnlti-user multiple-input multiple-output (MU- 
MIMO), maximum a-posteriori (MAP) channel estimation, min¬ 
imum mean-square error (MMSE) data detection, orthogonal 
frequency-division multiplexing (OFDM), qnantization. 


I. Introduction 

We investigate the effects of coarse quantization at the 
base-station (BS) on the performance of uplink data trans¬ 
mission in a massive multi-user (MU) multiple-input multiple- 
output (MIMO) system. To enable reliable communication 
over frequency-selective channels, we develop new methods 
for (optimal) channel estimation and data detection for the 
case when massive MU-MIMO is combined with orthogonal 
frequency-division multiplexing (OFDM). Fig. [T] illustrates 
the massive MU-MIMO-OFDM uplink transmission system 
studied in this paper. We consider a coded massive MU-MIMO 
system where U independent single-antenna user terminals 
communicate with a BS equipped with B ^ U antennas. Our 
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model includes coarse quantization in the analog-to-digital 
converters (ADCs) at the in-phase and quadrature outputs of 
each radio-frequency (RF) chain at the BS side. The square- 
dashed box at the BS side encompasses the proposed channel- 
estimation and data-detection algorithms, which—as we shall 
see—enable reliable wideband data transmission, even when the 
ADCs have low precision (e.g., four to six quantization bits). 

A. Benefits of Quantized Massive MIMO 

In conventional multiple-antenna BSs, each RF port is 
connected to two high-precision ADCs (typically, the in-phase 
and the quadramre signal components are quantized with 
resolutions exceeding 10 bit). Scaling such architectures to 
massive MIMO with hundreds or thousands active antenna 
elements would result in prohibitively high power consumption 
and hardware costs, especially for systems operating in the 
millimeter-wave frequency bands. Since the power consumption 
of an ADC scales roughly exponentially in the number of 
quantization bits ||TJ, reducing the precision of the ADCs 
provides an effective means to accommodate a massive array of 
active antenna elements, while—at the same time—^keeping the 
power consumption and system costs within desirable limits. 
Furthermore, the use of low-precision ADCs allows for a 
relaxation of the quality requirements on the RF circuitry (e.g., 
low-noise amplifiers, oscillators, and mixers), which enables a 
further reduction in both the power consumption and system 
costs. The underlying reason is that RF circuitry needs to 
operate at precision levels “just above” the quantization noise 
floor. Apart from power and cost reductions, coarsely quantized 
massive MIMO also reduces the amount of bits per second 
that needs to be transferred from the antenna unit (usually 
located on top of the cell tower) to the baseband processing 
unit (usually located at the bottom of the cell tower). The 
deployment of low-precision ADCs at the BS mitigates this 
data-transfer bottleneck. 

B. Relevant Prior Art 

Massive MU-MIMO promises to increase the spectral 
efficiency at reduced signal-processing complexity compared 
to that of conventional, small-scale MIMO systems, and is 
therefore considered to be a key technology for next-generation 
wireless systems 0 ^ 0 - Most theoretical results on the 
performance of the massive MU-MIMO uplink are for the 
scenario where the base-station is equipped with infinite- 
precision ADCs. The performance impact of low-precision 
ADCs has been analyzed only recently. 

Bjomsson et al. 0 modeled the aggregate effect of various 
residual hardware impairments (including quantization noise) 
as additive Gaussian noise that is independent of the transmit 
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Fig. 1. Overview of the considered quantized massive MU-MIMO-OFDM uplink system. Left: U single-antenna user terminals; right: massive MIMO 
base station with B U antennas and coarse quantization in the analog-to-digital converters (ADCs). The proposed system includes OFDM to simplify 
communication over frequency-selective, wideband channels. 


signal. Based on this model, it was concluded that massive MU- 
MIMO exhibits a certain degree of robustness against hardware 
impairments. While the underlying Gaussian impairment model 
was demonstrated to be accurate for a diverse set of residual 
RF impairments Q, Q, this model is a poor match for coarse 
quantization, which causes deterministic distortions that are— 
by nature—dependent on the transmit signal. 

Risi et al. Q analyzed the performance of a massive MU- 
MIMO uplink system under the assumptions that the ADCs 
have 1-bit resolution, that the users transmit QPSK symbols, 
that the receiver performs maximum-ratio combining or zero 
forcing, and that pilot transmission followed by least-squares 
(LS) channel estimation at the BS is used to acquire channel- 
state information (CSI) at the receiver. Their result—albeit 
limited to QPSK—shows that massive MU-MIMO is resilient 
against coarse-quantization noise. The motivation for using LS 
channel estimation is that the maximum a-posteriori (MAP) 
estimator is claimed to have a computational complexity that 
scales exponentially in the number of active users. 

The analysis in Q has been extended recently to higher-order 
modulation schemes in Q. There, it was shown that amplitude 
information about the transmitted signal can be recovered even 
in the presence of a single-bit quantizer, provided that the 
number of receive antennas is sufficiently large and the signal- 
to-noise ratio (SNR) is not too high. The observation that 
noise can help recovering magnitude information under 1-bit 
quantization has also been made in the compressive-sensing 
literature 0, (H. 

Liang and Zhang LD considered a mixed-ADC massive MU- 
MIMO architecture, where few high-resolution ADCs are used 
together with many 1-bit ADCs. The high-resolution ADCs 
allow the receiver to acquire accurate channel-state information 
(CSI). Generalized mutual information was used to characterize 
the performance of this architecture for the case of Gaussian 
codebooks and mismatched nearest-neighbor decoding. In in)> 
it is also noted that dithering prior to quantization may help 
improving the performance. 

In summary, the results in Q, 0, Q, | |1H suggest that 
massive MU-MIMO provides resilience against noise caused 
by coarse quantization. However, all these results pertain to 
massive MU-MIMO systems operating over frequency-flat 


fading channels. Such channels are rarely encountered in 
modern broadband wireless communication systems, which 
instead experience a high degree of frequency selectivity. This 
paper investigates the impact of coarse ADC quantization 
on the error-rate performance in systems operating over 
frequency-selective channels, which is—as we believe—more 
relevant for next-generation wireless systems. Although a 
frequency-selective channel can be converted into a set of 
independent frequency-flat channels by means of OFDM, such 
an orthogonalization relies on the assumption that infinite- 
precision ADCs are used on the time-domain signals (see 
Section [Ill-A| for the details). 

Outside the massive-MIMO literature, the performance of 
communication systems employing low-precision ADCs has 
been previously analyzed, mainly in the context of ultra- 
wideband and millimeter-wave systems. The structure of the 
capacity-achieving input distribution for point-to-point AWGN 
channels with low-precision ADCs has been characterized 
in HI- An extension to fading channels in the limiting regime 
of 1-bit ADCs for the case of both no CSI and of CSI available 
only at the receiver was provided in |Tg-|Tg. The case 
of full CSI at both the transmitter and receiver has been 
analyzed recently in p^ . In the low-SNR regime, it was shown 
in m that a zero-threshold comparator is not optimal for 1- 
bit ADCs. Instead, the capacity-achieving strategy requires the 
combination of flash signaling p8] Def. 2] with a suitably- 
chosen quantization threshold. 

The optimization of the ADC levels in single-antenna 
communication systems operating over deterministic (i.e., non¬ 
fading) frequency-selective channels was tackled in | [10 using 
bit error rate as the performance metric, and in ]20[ using 
the mutual information. Sampling faster than the Nyquist 
rate may be beneficial when using low-precision ADCs (see, 
e.g., PH , p2| and P3| Sec. VI]). For example, it was 
shown in p4| that by oversampling at the receiver, one can 
reliably transmit 16-QAM symbols over a 1-bit-quantized 
single-antenna AWGN channel. 

The strong nonlinearity introduced by low-precision ADCs 
renders the tasks of MIMO channel estimation and data 
detection challenging. Such problems can be cast in the more 
general framework of parameter estimation in the presence 
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of quantized output measurements @-1^. The specific 
case of channel estimation has been discussed in | [3T| , where 
the maximum-likelihood (ML) estimate is computed via 
expectation maximization. For the case of 1-bit ADCs combined 
with time-multiplexed pilot transmission, it was shown recently 
that the ML estimate has a closed-form expression | |32) . Data 
detection algorithms have heed discussed, e.g., in P3|-j36|. All 
these works, however, deal exclusively with systems operating 
over frequency-flat (or narrowband) fading channels. 

C. Contributions 

We investigate the effects of coarse quantization on a massive 
MU-MIMO-OFDM uplink system operating over frequency- 
selective, wideband channels. We develop new channel- 
estimation and data-detection algorithms, and investigate their 
performance/complexity trade-offs. Our main contributions are 
summarized as follows: 

• We show that, under suitable assumptions on the statistics 
of the fading channel and of the noise, MAP channel 
estimation in the presence of (coarse) quantization is 
a convex problem that can be solved exactly using 
computationally efficient numerical methods. 

• We formulate the problem of MAP data detection in the 
presence of quantization and develop a low-complexity 
minimum-mean square error (MMSE) data-detection al¬ 
gorithm. 

• We develop channel-estimation and data-detection algo¬ 
rithms for two mismatched quantization models, which 
trade error-rate performance for computational complexity. 
This trade-off is investigated through a comprehensive set 
of numerical simulations. 

Our results demonstrate that massive MIMO enables the use of 
coarse quantization at the BS, without a signiflcant performance 
loss and without an increase in computational complexity com¬ 
pared to the case of systems with infinite-precision quantizers. 
Although we focus on the massive MU-MIMO-OFDM uplink, 
our algorithms also apply to traditional, small-scale MIMO- 
OFDM or even single-antenna OFDM systems. 


proposed channel-estimation and data-detection algorithms for 
a simplified single-antenna OFDM system. Section |IV] deals 
with the extension of the proposed algorithms to the full- 
fledged quantized MU-MIMO-OFDM system illustrated in 
Fig. 0 Section |V] summarizes the numerical methods used to 
solve the convex optimization problems for channel estimation 
and data detection. Section [Vl] provides numerical simulation 
results. We conclude in Section IVlIl 

IT Exact and Mismatched Quantization Models 

We now introduce the quantization model considered 
throughout the paper. We also propose a mismatched quan¬ 
tization model that enables suboptimal receiver algorithms 
requiring lower computational complexity. 

A. Quantization of Complex-Valued Data 

The in-phase and quadrature components of an RF-chain’s 
outputs are typically converted to the digital domain using 
a pair of ADCs. After a sample-and-hold stage, the analog 
time-domain sample s G C is mapped onto a finite-cardinality 
quantization alphabet according to q — Q(s), where Q( ) : 
C —>^ is a complex-valued scalar quantizer and Ac is the 
complex-valued quantization alphabet. In what follows, we 
focus on scalar quantizers that operate independently on the 
in-phase (real) and on the quadrature (imaginary) part of each 
sample. With a slight abuse of notation, we frequently apply 
the quantizer to a vector/matrix, with the understanding that 
quantization is applied entry-wise to the vector/matrix. 

The real part and the imaginary part of each complex 
sample s are mapped onto a label from the quantization set 
A — Q} (the same for both real and imaginary part) 

of cardinality Q = \A\. Hence, we have that Ac = A x A. 
Each label describes the quantization bin in which and 
fall into. Consequently, the quantizer output Q{s) associated 
to a complex-valued sample s = -f is^ consists of a pair 
G A X A, or, equivalently, of a complex number 
q = q^ iq^ ■ The labels are determined by comparing and 
with Q -f 1 quantization bin boundaries {bg}: 


D. Notation 


-oo = bi < b 2 < ■ ■ ■ < bq < bq+i = -boo. 


Lowercase and uppercase boldface letters designate column 
vectors and matrices, respectively. For a matrix A, we indicate 
its conjugate transpose by . The fth column of the matrix 
A is denoted by a^, the entry on the fcth row and on the fth 
column is A^^e, and the /cth entry of a vector a is [a]fc = a^. 
The M X M identity matrix is denoted by 1m and the 
M X N all-zeros matrix by Omxn- The real and imaginary 
parts of a complex scalar a are and , respectively. The 
multivariate complex-valued circularly-symmetric Gaussian 
probability density function with covariance matrix K is 
denoted by CA/^(0,K). We use p(-) to denote generic proba¬ 
bility mass functions, whereas /(•) is reserved for probability 
density functions. 


E. Paper Outline 

The rest of the paper is organized as follows. Section 
introduces the quantization model. Section III describes the 


Specifically, if bm <s^< bjn+i, m = 1,... ,Q, then q^ = m. 
The imaginary part of the quantizer output q^ is obtained 
by applying the same rule to . We use Qi, = log 2 |A| to 
designate the number of quantization bits per real dimension. 
The total number of bits per complex-valued sample is 2Qt,. 

In what follows, we shall focus on the scenario where the 
thermal noise in the system (which appears before the quantizer) 
is a stationary memoryless circularly symmetric complex 
Gaussian process, whose samples have variance = Nq/2 
per real and imaginary part. With this assumption, we obtain 
the following nonlinear input-output relation for the quantizer: 

q= Q{z-i-n). (1) 

Here, z G C is the noiseless, unquantized (time-domain) sample 
and n ~ CAf{0, Nq) is the thermal (receive) noise that is added 
to the signal of interest prior to quantization. Throughout the 
paper, we make frequent use of the likelihood p{q | z) of a 
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quantization label q & Ac given a noiseless unquantized time- 
domain sample z. To compute this quantity, we use the facts 
that (i) the real and the imaginary parts of the thermal noise 
n are independent and (ii) the complex-valued quantizer Q( ) 
operates independently on the real and the imaginary part. 
These two properties imply that 

p{q\z = + iz^) = p{q^ \ z^)p{q^ \ z^). (2) 


We can now compute the probability of observing the real 
part q^ of the quantization label, given the real part z^ of the 
noiseless unquantized sample as follows 1^, 0, 0: 


p(g« I z«) = p{i{q^) < z^ + < u{q^)) 

(z^-vf 


= $ 


cuig^ 

h{qR) 

' U 


72 


exp - 


(g«) - 


TTCr^ 


- $ 


2a2 


Av 


(3) 


Here, $(a) = /“^(l/7^) exp(-i^2/2)di/ is the cumulative 
distribution function of a (real-valued) standard normal random 
variable; u{q^) = and i{q^) = b(^qR) are the upper 

and the lower bin boundary positions associated with the 
quantized measurement q^ G A, respectively. The probability 
of observing the imaginary part q^ of the quantization label, 
given z^ 


is obtained analogously as 


p(7 I z7 = $ 


i{q^) - 


- $ 


77)- 


(4) 


B. First Mismatched Quantization Model 


The above (exact) quantization model often leads to com¬ 
putationally expensive numerical algorithms that require high 
arithmetic precision. We therefore also consider the case where 
the receiver algorithms are designed on the basis of mismatched 
quantization models; such models enable the development of 
simpler and (often) faster algorithms at the cost of a small 
performance loss. By “mismatched quantization model,” we 
mean that algorithms for channel estimation and data detection 
are developed on the basis of a quantized input-oi^ut relation 
that does not match the exact one given in QQ Obviously, 
such a mismatch yields an error-rate performance loss. 

To describe our hrst mismatched quantization model (which 
we abbreviate as “Mismatch 1”), it is convenient to assign to 
each quantization label q a complex value y{q) whose real 
and imaginary parts lie within the quantization bin boundaries 
S{q) = [£{q^), u{q^)) X [£{q^),u{q^)) associated to q. Let 
f{v) denote the probability density function of the unquantized 
noisy signal v = z + n. Following |28) , we take y{q) as the 
centroid of S{q) under f{v), i.e.. 


^ Is(q) 

” /s(9)/7)d 


Now we can write 


y{q) = z + n + e 


( 6 ) 


'The term “mismatched” is commonly used in the information-theoretic 
literature to denote the scenario where the decoder operates according to a 
specific, possibly suboptimal, rule; see e.g., |38| and references therein. 


for some quantization error e that depends on the unquantized 
signal z -I- n. Note that, so far, we have just provided an 
alternative description of Q- We now obtain a mismatched 
quantization model by assuming that the quantization error 
e is independent of both z and n. Furthermore, we shall 
approximate its distribution by a circularly symmetric complex 
Gaussian distribution with zero mean and variance 


777 


h(q) 

!s(q) 


(7) 


which depends on the quantization label q. This approximation 
has been suggested in ]28| in the context of sparse signal 
recovery from noisy, quantized measurements. 

In this first mismatched quantization model, the likelihood 
p{q I z) takes the following form (cf. (|^ for the exact model); 


p{q I z) 


1 

7 r(iVo -f 72(g)) 


\z-y{q)? \ 

^0 + 777 / 


( 8 ) 


To compute and 0, one can either approximate the 
probability density function f{v) with a uniform distribution 
over the set S{q), or—if known—use the distribution of the 
unquantized signal v = z + n. 

In summary, the first mismatched quantization model is 
obtained by assuming (i) that the quantization error does not 
depend on the signal z or the noise n, and (ii) that the quantiza¬ 
tion error is Gaussian. We shall see that despite the crudeness of 
these approximations, the first mismatched quantization model 
enables computationally efficient channel-estimation and data- 
detection algorithms, which closely approach the error-rate 
performance of (more complex) algorithms developed for the 
exact quantization model detailed in Section |II-A| 


III. Quantized Single-Input Single-Output OFDM 

We now explain the principles of the channel-estimation and 
data-detection algorithms proposed in this paper. In order to 
avoid that the key features of our algorithms be obfuscated 
by the unavoidably intricate notation needed to describe a 
quantized massive MU-MIMO-OFDM uplink system, we first 
focus on a simple, quantized single-input single-output (SISO) 
OFDM system. The full-fledged quantized massive MU-MIMO- 


OFDM uplink system will be discussed in Section IV 


A. Quantized SISO-OFDM System Model 

Let the set O contain the points of the chosen digital modu¬ 
lation format (e.g., QAM or PSK). Assume that the transmitter 
converts a kF-tone frequency-domain signal s G into the 
time domain using an inverse discrete Fourier transform (DFT) 
according to v = F^s. Here, F denotes the W x W DFT 
matrix that satisfies F^F = I. After prepending the cyclic 
prefix, the signal is transmitted over a frequency-selective 
channel. The receiver quantizes the baseband time-domain 
signal and discards the cyclic prefix. The resulting time-domain 
input-output relation is given by 

q=Q(HF^s + n). (9) 

Here, the vector q G A^ contains the quantization labels, 
which are obtained by applying the quantizer Q( ) element¬ 
wise to the vector HF^s + n. The W x W channel matrix 
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H is circulant and contains the samples of the channel’s 
impulse response. The vector n G has independent and 
identically distributed (i.i.d.) circularly symmetric complex 
Gaussian entries with variance Nq. 

Since circulant matrices are diagonalized by the Fourier 
transform, we can rewrite (|^ as 

q = Q(F^diag(h)s + n) (10) 

where diag(h) = FHF^ stands for a diagonal matrix con¬ 
taining the frequency-domain representation h of the channel 
impulse response on its main diagonal. Unfortunately, the 
nonlinear nature of the quantization operator Q( ) implies 
that it is not possible to convert the time-domain input-output 
relation ( fTOl i into a diagonalized frequency-domain input-output 
relation, simply by computing a DFT at the receiver. 


B. MAP Channel Estimation 


We now show that pilot-based MAP channel estimation of the 
channel h in ( [TOl l can be formulated as a convex optimization 
problem. Specihcally, we assume a training phase consisting 
of a singl^OFDM symbol, which we model as follows: 

q=Q(Th-fn). (11) 

Here, T = F^diag(t) where the vector t S C'^ contains the 
pilot symbols (which are known to the receiver). To obtain ( [TT] ) 
from ( [Tol l, we set s = t and used that diag(h)t = diag(t)h. 
By assuming prior knowledge of the probability density 
function /(h) of the channel h, we can formulate the following 
MAP channel-estimation problem 

h^AP = arg max p(q | Th)/(h). (12) 

heCP' 


We now show that, under mild assumptions on the probability 


density function /(h), the optimization problem (12i can be 


solved exactly via convex optimization, despite the nonlinearity 
introduced by the quantizer Q(-). To this end, we hrst take the 
negative logarithm of the objective function on the right-hand 
side (RHS) of ([^, which allows us to rewrite •EH) in the 
following equivalent form: 


h^AP = arg min \ - logp(q | Th) - log /(h) 

hGC*v 


(13) 


It is now key to realize that the hrst term — logp(q | Th) on 
the RHS of ( [T3| ) is convex in h. To show this, we require the 
following result. 

Theorem 1: The negative log-likelihood — \ogp{q | z) in (j^ 
is a smooth, convex function in z G C. 

Proof: We start by noting that 

- \ogp{q \z) = - \ogp{q^ I z^) - \ogp{q^ \ z^). (14) 


To prove the theorem, we shall show that both summands on 
the RHS of ([^ are smooth convex functions in the variables 


^The number of degrees of freedom of frequency-selective channels is 
typically much smaller than the number of OFDM tones. This property ensures 
that a single training symbol is sufficient to acquire accurate channel estimates, 
even in the presence of coarse quantization. Nevertheless, our algorithms can 
easily be extended to systems that use more than one training symbol. 


and . Following the line of reasoning in |28 Sec. III-A], 
we note that the function 


pu-c 1 

^{u — c) — — c) = / _ exp(—i/^/2)dz/ 

JE—c V StT 

which appears in the expressions for p{q^ \ z^) and p{q^ \ z^) 
in 0 and 0, respectively, is log-concave in the real variable c, 
because it is a convolution between two log-concave functions: 
the Gaussian pdf and the indicator function of the interval 
[£,«]. This concludes the proof of the theorem. ■ 

Since Th is linear in h. Theorem [^implies that the negative 
log-likelihood - logp(q | Th) = J2w=i “ logP(9» I [Th]«,) 
is convex in h. Hence, the MAP channel-estimation problem 
( |13| l is convex as long as the negative logarithm of the prior 
term — log/(h) is convex. This property is satisfied by any 
log-concave prior distribution /(h) on the channel vector h. 
For example, if the entries of h are i.i.d. circularly symmetric 
complex Gaussian with variance Cq (which corresponds to i.i.d. 
Rayleigh fading), we have — log/(h) = UhlU/Go+ log(7rC'o), 
which is a smooth, convex function in h. 

If we assume that no prior knowledge on the probability 
distribution of h is available, which is equivalent to letting 
Co —oo in our example, we transform the MAP channel- 
estimation problem into an ML channel-estimation problem, 
which remains convex. The crucial observation that MAP and 
ML channel estimation in the presence of the quantizer Q( ) 
can be formulated as convex problems implies that efficient 
numerical methods for channel estimation exist (see Section jV] 
for the details). 


C. MAP and MMSE Data Detection 


To perform data detection from quantized measurements, we 
hrst rewrite the input-output relation (lOi as q = Q(Hs -f n). 


where H = F^diag(h) and h stands for either the MAP or the 
ML channel estimate obtained solving the convex optimization 
problem described in Section |III-B| 

1) Hard-Output MAP Detection: By assuming that prior 
information on the probability mass function pis) of the 
transmit data vectors s G is available, one could—in 
principle—minimize the probability that the vector s is detected 
erroneously by solving the following MAP data detection 
problem 0 


= arg max p(q | Hs)p(s). 

sGO»' 


(15) 


In absence of prior information, one can transform ( [T5] l into an 
ML data detection problem by assuming equally-likely transmit 
vectors, i.e., that p{s) — Vs G . Since the matrix 

H is, in general, not diagonal, this optimization problem is 
at least as hard as the MAP (or ML) data-detection problem 
in conventional, unquantized MIMO systems with W spatial 
streams. Specihcally, solving exactly entails an exhaustive 
search whose complexity grows exponentially in the number 
of subcarriers W (see |39|, |40| for more details). Since W 


^To be precise, the data-detection problem in Gg is the MAP problem only 
under the assumption of perfect channel state information. Since we estimate 
the channel in our setup, this detector is actually mismatched. This observation 
holds for all detectors presented in the remainder of this section. 
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can be of the order of hundred or even thousand in modern 
OFDM systems, solving ( [T5] l directly is not feasible and one 
has to resort to approximate algorithms. 

2) Hard-Output and Soft-Output MMSE Detection: To arrive 
at a low-complexity data-detection method, we relax the hnite- 
alphabet constraint s G so that the resulting data-detection 
problem becomes convex and can be solved efficiently. We 
make the common approximation that the entries of s are 
i.i.d. circularly symmetric complex Gaussian random variables 
with variance (where Eg stands for the average energy per 
transmitted symbol); this allows us to formulate a quantized 
version of the well-known MMSE equalizer: 

s = arg mini-logp(q I Hs) -I- (16) 
sGCrr I Eg } 

As a consequence of Theorem this optimization problem is 
convex and can be solved efficiently. Its solution can either 
be mapped element-wise onto the closest constellation point 
as = arg min^gg |a — s^,|, tu = 1,..., W, to obtain a 

hard-output MMSE estimate or used to compute soft- 

information (for a given mapping between bits and constellation 
symbols) in the form of max-log log-likelihood ratio (LLR) 
values pT| , p2l : 

Ln,,b = Pw min |s^ - ap - min \s^ - a\^. (17) 

aGO<«> aeOi^^ 

Here, denotes the post-equalization signal-to-interference- 
and-noise ratio (SINR) on the wth subcarrier, and and 
refer to the subsets of O consisting of all constellation 
symbols whose 6th bit is 0 and 1, respectively]^ We note 
that solving the convex problem yields, in general, 
no information on the SINR pw, which makes an exact 
computation of ([T^ impossible. Throughout the paper, we 
approximate ( [l7] l by setting p^, = 1, unless stated otherwise. 
We note that this approximation was shown in |43| , ]44) to 
result in near-optimal performance for the case of massive 
MIMO data detection with max-log channel decoding. 


D. Channel Estimation and Data Detection with the First 
Mismatched Quantization Model 

We now turn our attention to the hrst mismatched quantiza¬ 
tion model introduced in Section |II-B| and derive the optimal 
channel estimator and data detector for the case in which the 
receiver is designed on the basis of this mismatched model. 
With this mismatched quantization model, the input-output 
relation ( [TOl i is replaced by 

y(q) = F^diag(h)s-f n-f e. (18) 

In ([^, which is the OEDM generalization of the scalar input- 
output relation (j^, the noise vector follows a CA/^(0, iVoIw) 
distribution, and the entries of the quantization noise 

vector e are distributed according to CJV{0,j'^{qw))- The 
variance where q^j denotes the quantization label 

associated to the received signal on the w subcarrier, is dehned 

“^The subsets and depend on the mapping from constellation 

symbols to bits (e.g., a Gray mapping). 


similarly to 0- Proceeding as in the case of unquantized SISO- 
OFDM systems, we apply a DFT to the time-domain vector 
y(q) and obtain the following frequency-domain input-output 
relation: 


y = diag(h)s-f h-f e. (19) 


Here, y = Fy(q), h ~ CJ\f(0, NqIw), and e ~ CA/’(0, Kg) 
where Ke = FFF^ is the covariance matrix of the quanti¬ 
zation noise e, expressed in the frequency domain. Finally, 
r = diag(j^(qi),... ,j^(qw)). We note that the entries of 
the vector e are, in general, correlated. As we shall discuss 
next, this has a negative impact on the complexity of MAP 
channel-estimation and data-detection algorithms. 

i) MAP Channel Estimation: Analogously to P3| l, MAP 
channel estimation for the mismatched quantization model ( [T9l l 
can be formulated as follows: 


j^MAP _ 


= arg mm 
hGC’*' 


{||K i/2('^_(jiag(t)h)||2 -logp(h)|. 


( 20 ) 


Here, K = TVoIpu + Ke stands for the covariance matrix of the 
Gaussian additive noise plus the frequency-domain quantization 
noise, and t G C'^ denotes the vector containing the pilot 
symbols used for channel estimation. If p(h) is log-concave, the 


problem (201 is also convex and can be solved efficiently using 
numerical methods. In the case where p(h) is Gaussian, 
reduces to a IF-dimensional LS problem. 

2) MAP and MMSE Data Detection: Similarly to the MAP 
channel-estimation problem ( [20) l, we can formulate the MAP 
data-detection problem as follows: 


-MAP 

s = arg mm 

sGOrr 


|K i/2(y-diag(h)s)||2-logp(s) 


( 21 ) 


Here, h is the MAP channel estimate, obtained by solving 
Since, as aheady mentioned, K is in general not diagonal, 
solving pT) enta ils prohibitive complexity (see the discussion 
in Section III-C[ l. 

To reduce the computational complexity of the MAP data- 
detection problem, we relax the alphabet constraint in ( |21| l 
by proceeding as in Section III-C and obtain the following 
mismatched version of the quantized MMSE equalizer in ([T^: 


gMMSE _ 

sGC”" 


|K-i/2(y_diag(h)s) 


!^ + 


Eg 


( 22 ) 


The optimization problem in ( |22l l is a PE-dimensional LS 
problem and its solution can be used to perform hard-output 
or soft-output detection, as described in Section [Ill-C| 


E. Second Mismatched Quantization Model 

We now provide a second mismatched quantization model 
(abbreviated as “Mismatch 2”), which yields a signihcant 
complexity reduction for channel estimation and data detection 
(cf. Section |III-D| l. The key idea is to replace the noise 
vector e in (|18[l, whose entries are independent but not 
identically distributed, with a vector e with i.i.d. entries. 
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Specifically, we let the entries of e be CA/^(0, 7 )-distributed 
and set 7 = ^ which is the average variance 

of the entries of the vector e. This approach results in the 
following mismatched quantization model; 

y(q) = F^diag(h)s + n + e. (23) 

The advantage of this second mismatched quantization model 
is as follows; when we apply the DFT operator to y(q), the 
total noise remains uncorrelated because the correlation matrix 
of n + e is a scaled identity (cf. ([T^). This means that the 
MAP channel-estimation and data-detection problems decouple 
into W independent one-dimensional problems that can be 
solved at low computational complexity]^ In particular, one 
can deploy standard OFDM signal-processing techniques that 
operate tone-wise in the frequency domain. An immediate 
benefit of this mismatched quantization model is that we can 
easily extract the post-equalization SINR values which are 
required for LLR computation in ([T7|. 


IV. Quantized Massive MU-MIMO-OFDM Uplink 


We are now ready to present our quantized massive MU- 
MIMO-OFDM uplink scheme in full detail. Although the 
combination of MU-MIMO and OFDM renders the notation 
rather involved, the main ideas of our approach follow those 
discussed in Section m for the quantized SISO-OFDM case. 
We start by introducing the system model. We then derive 
the MAP channel-estimation and MMSE data-detection algo¬ 
rithms for the exact quantization model. Finally, we propose 
low-complexity algorithm variants based on the mismatched 


quantization models introduced in Sections II-B and Ill-E 


A. Quantized MIMO-OFDM System Model 

We consider a coded MU-MIMO-OFDM uplink system that 
employs spatial multiplexing. The system model is depicted 
in Fig. and consists of U independent single-antenna user^ 
communicating with a BS having B > U receive antennas. 
We consider a frame-based OFDM transmission, where each 
OFDM symbol consists of W tones, similar to what is 
used in IEEE 802.1 In m- We furthermore assume that 
communication is effected in two phases; (i) a training phase 
consisting of U OFDM symbols (which enables the training 
of every channel coefficient at least once) followed by (ii) 
a data transmission phase consisting of D OFDM symbols. 
The disjoint sets Udata, f^piiot, and Uguard contain the indices 
associated to data tones, pilot tones, and guard (or zero) tones, 
respectively. Clearly, |Udata| + l^^piiotl + l^^guardl = W. For the 
sake of simplicity, we assume a sufficiently long cyclic prefix, 
perfect synchronization, and ignore hardware impairments such 
as phase noise, I/Q imbalance, and amplifier nonlinearities (see, 
e.g., 0, pb) , | |47| for more details on hardware impairments 
in OFDM systems). 

^This holds provided that the priors are product distrihutions, i.e., that 
/(h) = nSLi /(^™) and that p(s) = OjLl ?(»»)■ 

®Our framework enables also the use of multi-antenna user equipments. In 
this case, forward error correction and channel decoding can he performed 
over the multiple antennas. 


1) User Terminal (Transmit Side): During the training phase, 
the data and the pilot tones contain QPSK training symbols that 
are known to the BS; the guard tones remain unused. During 
the data-transmission phase, each user encodes its information 
bits using a forward error correction (FEC) channel code and 
maps the coded bits onto constellation symbols & O for 
w G Udata to be transmitted over the data-carrying tones. The 
pilot tones contain BPSK symbols; in a real system, they can 
be used to compensate for residual phase noise or frequency 
offset pT) . The guard tones are still left unused. Eor a given 
OFDM symbol, the frequency-domain transmit vector of each 
nser terminal u, which is compactly described by the vector 
s„ G is converted into the time domain using a VU-point 
inverse DFT according to v„ = F^s^. After prepending a 
cyclic prefix of length P, the time-domain signals v„ of all 
users u = 1,... ,U are transmitted simultaneously and in the 
same frequency band. 


2) Base Station (Receive Side): Each of the B antennas 

at the BS receive a noisy mixture of time-domain user 
data. Specifically, let G denote the unquantized 

time-domain received vector at antenna b, after removal of 
the cyclic prefix. The time-domain received vector at each 
antenna b = 1,... ,B is qnantized according to q?, = Q(yb), 
and then passed to the channel-estimation/data-detection unit. 
During the training phase, the channel-estimation unit generates 
estimates H^, G u> = 1,..., lU, for all OFDM tones. 

During the data-transmission phase, the data-detection unit 
generates LLR values for each coded bit of each user, on the 
basis of the channel estimates and of the quantized received 
vectors. The resulting (approximate) soft-information is nsed 
to perform decoding on a per-user basis. 

3) Channel model: To simplify the notation, we make 
frequent use of the mapping | |48[ 

= {Xaf=i (24) 

between matrices X^, G w = in per- 

frequency orientation and matrices XJ, 6 = 

in per-BS-antenna orientation. Here, T = U + D is the total 
number of OEDM symbols (during the training and the data- 
transmission phases). Eormally, the mapping ( |24| l is defined 
as [X.u,]b 7 = [X'f,]u, 7 , where w = b = 1,...,B, 

and i = 1,... ,T are the frequency (or time) index, the 
BS-antenna index, and OEDM-symbol index, respectively. To 
illustrate the effect of the mapping T in ( |24l i, consider the 
transmission of a single OFDM symbol (T = 1) consisting 
of W snbcarriers to a BS eqnipped with B antennas. We 
can either represent the overall received signal as W vectors 
Xw G O^, each containing B symbols, or, equivalently, as B 
vectors xj, G O^, each containing W symbols. The operator T 
allows us to conveniently switch from one representation to 
the other. A pictorial representation of this mapping is shown 
in Fig. ig 

Using the mapping T in ( [24) i together with the single-antenna 
OFDM input-output relatiori|lO|), we can write the input-output 
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per-frequency 

orientation 


per-BS-antenna 

orientation 




Fig. 2. Mapping T between per-frequency orientation (left) and per-BS- 
antenna orientation (right). The data cube (frequency tones x BS antennas x 
OFDM symbols) remains the same but data is associated differently to the 
matrices X and X'. 


relation of the quantized MU-MIMO-OFDM system as follows: 

{Zh}ti = r{H^S^}]ti (25) 

Q, = Q(F^Zfc + Nb), b=l,...,B. 

Here, the inputs of the channel are the W frequency-domain 
matrices S^, G each containing the data symbols, the 

pilots, and the guard tones for T OFDM symbols and U 
users. The outputs of the channel are the B quantized time- 
domain matrices Q{, G (one for each BS antenna), 

containing a VF-dimensional time-domain signal for each 
of the T OFDM symbols. The frequency-domain channel 
matrices H^j G in ( |25] l, which are assumed to remain 

constant over the T OFDM symbols, can be obtained from the 
time-domain matrices Hj G t = 1,..., W, using the 

Fourier transform = F defined as follows 

m- 


about the channel is that its impulse response does not exceed 
the cyclic prefix length P. This results in the following convex 
optimization problem: 


(Q-CHE) < 


_ minimize 
subject to 


B 

-y]logp(Qb|F^Zfe) 

{'Mti = r{H„T^}r=i 

Ht = Osxc/, t = P + . ,W. 


Here, Q-CHE stands for quantized channel estimation. The 
solution of this convex optimization problem is the set of 
estimates of the MU-MIMO-OFDM channel in 

the frequency domain. The ingredients of the optimization 
problem (Q-CHE) are as follows. The objective function, i.e., 
the negative log-likelihood of the quantization labels given the 
noiseless received signal is the MIMO generalization of the 
objective function given in ( [T3] l for the SISO-OFDM casej^The 
first constraint simply transforms a per-frequency representation 
into a per-BS-antenna representation as illustrated in Fig. 
this simplifies the form of the objective function. The second 
constraint transforms the channel matrices from the frequency 
domain to the time domain. The third constraint captures that 
the channel’s impulse response has at most P nonzero taps. 
Similarly to the SISO-OFDM case, in (Q-CHE) the MAP 
channel estimation must be performed jointly over all W 
OFDM tones. Here, however, we additionally include the 
constraint that the channel’s delay spread does not exceed 
P taps. This enables us to acquire accurate channel estimates 
using only few OFDM pilot symbols. Because of the convexity, 
(Q-CHE) can be solved exactly and efficiently using first-order 
methods (see Section [V] for the details). 


H,„ = 


1 f O {t-l){w-l-W/2)\ 

-j. 


(26) 


Here, the time-domain matrices of dimension B xU, 

contain the impulse responses from each user to each BS 
antenna. In what follows, we assume that the impulse responses 
are supported on at most L < W taps, i.e., = Og^u for 

t = L-|-l,...,VF;we furthermore assume that the cyclic-prefix 
length P satisfies L < P. 


B. MAP Channel Estimation 

We are now ready to formulate MAP channel estimation as 
a convex optimization problem. As outlined in Section we 
seek estimates H^j of the channel matrices H^, w = 1,..., W, 
given the quantized measurements Qb, b — 1,...,B, and 
the known, orthogonal pilot matrices Tu, G where 

w G Hpiiot U fldata- Specifically, our objective is to minimize 
the negative log-likelihood of the quantized data, given prior 
information on the channel matrices to be estimated. Since it is 
difficult in practice to acquire accurate prior channel statistics]^ 
we assume that the only information known to the receiver 

^Especially in MU cellular scenarios, where users may experience vastly 
different path losses. 


C. MMSE Data Detection 

Next, we shall extend the MMSE detector ( [Tbl l to the MU- 
MIMO case. The key difference with respect to the SISO- 
OFDM case is that the resulting convex-optimization problem 
involves not only all subcarriers, but also all antennas. During 
the data-transmission phase, the following convex-optimization 
problem needs to be solved for each OFDM symbol: 


minimize 

Stij ,TOGr2data 


(Q-DET) < 


subject to 


B 

-^logp(qb |F^Zb) 
b^l 

+ ^liSiu|l2 

JUGOdala 

G Dpdot. 


Here, Q-DET stands for quantized detection. The solution of the 
optimization problem (Q-DET) are the estimates s^, G Udata of 
the transmitted data symbols; these estimates can then be used 
to compute hard or soft estimates as described in Section [TlI-CI 
The ingredients of this optimization problem are as follows. 


®Note that, differently from m the term corresponding to the prior 
distribution of the channel is missing from the objective function of (Q- 
CHE). This is because a priori information about the channel distribution is 
assumed unavailable at the receiver. 
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The objective function is the MIMO generalization of ( [T6| ) 
for the SISO-OFDM case. Specifically, we optimize jointly 
over the data vectors s^,, w G fldata^ transmitted by the U 
users, and we account for the fact that the base station is 
equipped with B receive antennas. The first constraint in (Q- 
DET) transforms a per-frequency representation into a per- 
BS-antenna representation as illustrated in Fig. The second 
constraint captures that the symbols t^, w G fipiiotj transmitted 
on the pilot tones are known to the receiver (see Section IV-A i. 
Similarly to the MAP channel estimation problem (Q-CHE), 
the problem (Q-DET) can be solved exactly and efficiently 
using first-order methods (see Section [V| for the details). 


which can be done using efficient algorithms. Eurthermore, 
the SINR values that are necessary for soft-output data 
detection ( |17| l can be obtained in this case at virtually no 
cost; see ]41| , | [42| , |511 for more details on unquantized (i.e., 
infinite-precision) soft-output MMSE data detection. 


V. Efficient Numerical Methods 

The problems (Q-CFIE), (Q-DET), and (MQ-CHE) require 
us to solve large-dimensional convex optimization problems. 
We now summarize a class of algorithms that can be used to 
solve these problems at low complexity. We refer the interested 
reader to ||52) for more details on these algorithms. 


D. Mismatched Quantization Models 

1) First Mismatched Quantization Model: We now briefly 
discuss the MAP channel-estimation and data-detection prob¬ 
lems for the first mismatched quantization model introduced in 
Section |II-B| In essence, we only replace the conditional prob¬ 
abilities p(j-) in (Q-CHE) and (Q-DET) with the mismatched 
conditional probability p(j-) in ([^. While the dimensionality 
of the resulting problems remains unchanged, the algorithms 
that use the mismatched quantizer exhibit improved numerical 
stability and (often) faster convergence rates at the cost of 
a small degradation in performance (see Section |V] for the 
details). 

2) Second Mismatched Quantization Model: For the second 


mismatched quantization model introduced in Section III-E 


the MAP channel estimation and the MMSE data detection 
problems simplify drastically. Specifically, after a DFT opera¬ 
tion at the receiver, the channel estimation problem decouples 
into U X B independent subproblems, each one involving the 
estimation of a PF-dimensional channel vector. Specifically, 
each channel estimation subproblem is of the form 


(MQ-CHE) 


'y ^ lijw ^uji 

in G { ^^pilot U ^^data } 

subject to [F^h]t = 0, t = P + 1, 


minimize 

hGCrr 


,w. 


Here, MQ-CHE stands for mismatched-quantization channel 
estimation. In the above problem, w G {flpiiot U HpUot}, are 
the frequency-domain entries of the matrices H„, w G {HpHotU 
f^piiot} that correspond to the uth user and the 6th BS antenna, 
which are estimated using the orthonormal pilot matrices 
Tu, (see Section |IV-B| l. Although (MQ-CHE) has a closed- 
form solution]^ we will use a first-order method to reduce 
both computational complexity and memory requirements (see 
Section |V|. In practice, one can also obtain approximate 
solutions to (MQ-CHE) using more efficient methods (see 


e.g., 1501). 


With the second mismatched quantization model, MMSE 
data detection can be performed independently per subcarrier, 
as it is the case for conventional unquantized MIMO-OFDM 
systems. Hence, MMSE data detection requires one to solve 
|f^data| independent B x U MMSE data detection problems. 


®The problem (MQ-CHE) can be solved by computing a projection matrix of 
dimension W X W, which involves inverting a matrix of the same dimension. 
Such a direct approach would not only result in excessive computational 
complexity but also consume large amounts of memory. 


A. Forward-Backward Splitting 

In many fields, including (but not limited to) image process¬ 
ing, machine learning, and spectral clustering, one is interested 
in solving high-dimensional convex optimization problems of 
the following form p3): 


minimize 6,(Ax) -f g(x.). (27) 

xec« 

Here, A G , the function h is convex and differentiable, 

and g is convex, but not necessarily smooth or bounded. The 
generality of the function g prevents the use of simple gradient- 
descent algorithms. Eor many problems of interest, however, 
the so-called proximal operator |53| 

proXg(z,r) = argmin{T 5 (x)-f i||x-z||^} (28) 

X 


can be computed efficiently. In this case, one can deploy a com¬ 
putationally efficient first-order method to solve ( |Z7] i, generally 
referred to as forward-backward splitting (FBS). Specifically, 
EBS performs the following iteration for k = 1,2,..., until 
convergence | |5^ : 

x(fc+i) = proXg(x('=) - A^V/i(Ax('=)), (29) 


EBS requires the computation of the proximal operator 
of the gradient V/i of the smooth function, the computation 
of matrix-vector products involving A and its adjoint A^, 
and multiplications by the step size > 0. To reduce 
the complexity of FBS, one should (i) exploit fast transforms 
instead of matrix-vector products (if possible) and (ii) select 
an appropriate step-size in every iteration k. While a 
number of fast FBS implementations have been described in 
the literature, the numerical results reported in Section VI use 


the fast adaptive shrinkage/thresholding algorithm (FASTA) 
proposed in 


B. Implementation Details 

We shall illustrate next that the optimization problems (Q- 
CHE), (Q-DET), and (MQ-CHE) described in Section are 
of the form given in ( |27| ). Indeed, in all these problems the 
objective function is smooth and convex. We can therefore 
identify these objective functions with h in ( |27l l. The (non¬ 
smooth) function g in (|27]i can be associated with the affine 
constraints listed in (Q-CHE), (Q-DET), and (MQ-CHE). To 
this end, let C be the feasible set defined by the affine constraints 
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of the optimization problems described in Section |IV-A| where 
A^c(x) is the characteristic function of C, which we define to be 
zero if X € C and infinity otherwise. We shall set g(x) = A'c(x), 
which is a convex (but not smooth or bounded) function. Next 
we detail how to compute the proximal operator ( |28] l and the 
gradient of h, which is needed in ( |29l l. 

1) Gradients: For the exact quantization model introduced 
in Section |II-A[ the gradient of the negative log-likelihood 
function — logp(q | a), for the case of real-valued a, is given 
by 


[Vh(a)], 


exp -- 


<l{qi)-ai) 

2cr2 


s/2 






$ 


' u{qi) — ai 




. (30) 


The complex-valued extension of ( |30l l, which is required in both 
(Q-CHE) and (Q-DET) can be readily obtained by computing 
the gradient separately for real and imaginary parts p7) . For 
the mismatched quantization model introduced in Section II-B[ 
the (complex-valued) gradient of — logp(q | a) is given by 


[Vh(a)], 


v[qi) - 

iVo/2 + 72(9,)’ 


(31) 


A comparison of ( [30l l and ( [3T| l reveals that gradient computation 
is easier and less computationally intensive for the mismatched 
quantization model. 

2} Proximal Operators: As we shall show next, the proximal 
operators associated with the convex-optimization problems (Q- 
CFIE), (Q-DET), and (MQ-CHE) can be evaluated efficiently. 
Specifically, the function g in (Q-CHE) corresponds to the affine 
constraint Ht = Obxc/> t = P + I,..., W. This implies that 
the proximal operator ( [28] l involves transforming the channel 
matrices from frequency domain to time domain, setting to zero 
all matrices with index t = P + 1,... ,W , and transforming 
the resulting matrices back to the frequency domain. By using 
the fast Fourier transform (FFT), the proximal operator can be 
evaluated efficiently. Similarly, (MQ-CHE) involves the affine 
constraint [F^h]t =0, t = P + 1,... ,W. The computation 
of the proximal operator requires transforming the frequency- 
domain vector h into time domain, setting to zero all entries 
with index t — P + 1, ..., W, and transforming the resulting 
vector back to frequency domain. Finally, (Q-DET) involves the 
affine constraints s^, = t^,, w G flpUot- The proximal operator 
simply requires one to set s^j equal to for all w G Hpiiot. 


VI. Simulation Results 

We will now present numerical simulation results for the 
proposed quantized massive MU-MIMO-OFDM uplink system. 
Due to space constraints, we shall focus on a selected set of 
system parameters 


A. Simulation Parameters 

All simulation results are based on an IEEE 802.IIn-Iike 
system | [45) with 40 MHz bandwidth (similarly to the OFDM 

’®Our simulation framework will be available for download from GitHub 
(https://github.eom/quantizedmassivemimo/mu_mimo_ofdm/l. The purpose is 
to enable interested readers to perform their own simulations with different 
system parameters and also to test alternative algorithms. 


system parameters considered in ||48]). The number of OFDM 
subcarriers is IF = 128; guard tones, pilot tones, and data 
tones are as defined in (45] . Each user encodes its information 
bits using a (rather weak) rate-5/6 convolutional code of 
constraint length 7 with random interleaving. Decoding at 
the base-station relies on a max-log soft-input Viterbi decoder. 
Channel training is performed as described in Section [TV-All In 
the data-transmission phase, the transmitted symbols belong to 
a 16-QAM constellation. The channel matrices are generated 
according to (26i, where the entries of the non-zero time- 
domain matrices Ht, f = 1,..., L, are assumed i.i.d. circularly 
complex Gaussian with unit variance. We assume that the 
number of channel taps is L = 4; the cyclic prefix length is 
P = 16. The pilot matrices T^, are generated from a U x U 
orthonormal Hadamard transform matrix, where the signs of 
the columns and rows are randomly alteredp^The quantization 
alphabet Ac and the quantization bin boundaries {bq} are 
set using the Lloyd-Max algorithm lig, i|^. For a given 
set of system parameters, the algorithm uses an estimate of 
the probability density function of the received signal, which 
is acquired offlinej^ We use the FASTA solver | |56) with 
tolerance set to 10^ and maximum number of iterations set 
to 500. The packet error rate (PER), which is averaged over 
all users, is obtained via Monte-Carlo simulations with 1000 
packet transmissions for each simulated pointj*^ 

To characterize the system performance for a given number 
of quantization bits Qb (see Section II-A| l, we compute the 
so-called “SNR operating point” proposed in (^, ||^, which 
is the minimum average receive SNR required to achieve 1% 
PER. We consider three receiver architectures. The first one, 
referred to as “Quantizer,” performs MAP channel estimation 
and MMSE data detection as described in Section IIV-BI and 
Section IV-C respectively. The second and third one, referred to 
as “Mismatch 1” and “Mismatch 2,” respectively, perform their 
computations as described in Section [TV-Dll and Section [rV-D2| 
respectively. 


B. Results and Discussion 

Fig. illustrates the trade-off between the SNR operating 
point and the number of quantization bits Qfa for different BS- 
antenna-to-user ratios. As a reference, we show the performance 
of an infinite-precision single-input multiple-output (SIMO) 
system both for the case of perfect CSI at the receiver (CSIR), 
and for the case of channel estimation (CHEST). To obtain 
these curves, we replaced the negative log-likelihood of the 
quantized channel output with the one of the channel output 
before quantization in the objective functions of the iterative 
algorithms described Section [IV| 

^^We observed that the choice of the pilot matrices has a noticeable impact 
on the error-rate performance in systems with coarse quantization. In particular, 
orthogonalizing the users in time during pilot transmission results in higher 
channel-estimation errors. A detailed investigation of this phenomenon, which 
is similar to the one described in in the context of modulation design for 
1-bit massive MIMO systems, is part of ongoing work. 

^^In practice, one can precompute the quantization bin boundaries and store 
them in a look-up table. 

*^The trade-off curves are obtained by performing 1000 Monte-Carlo trials 
for each SNR point. The irregular behavior of some of the curves is a 
consequence of the chosen number of trials and the SNR resolution. 
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Fig. 3. Trade-off between the SNR operating point and the number of quantization bits for 8 users and {16, 32, 64,128} base station antennas. Only a 
few bits are sufficient to approach the performance of the SIMO lower bound with infinite-precision ADCs. Furthermore, for systems with a large number of 
BS antennas (e.g., B = 128), no sophisticated signal processing is required in the presence of low-precision ADCs, i.e., the “Mismatch 2” receiver achieves 
near-optimal performance. 


Fig. [^a) compares the performance/quantization trade-off 
for the "Quantizer,” “Mismatch 1,” and “Mismatch 2” receiver 
architecmres for a system with 16 BS antennas and 8 users 
(corresponding to a BS-antenna-to-user ratio of 2). We see that 
“Quantizer” outperforms both “Mismatch 1” and “Mismatch 2” 
when 3 < Qb < 5. When Qb < 3, the system is not 
able to reach a PER of 1%. Interestingly, when Qb > 6, 
“Mismatch 2” outperforms both “Quantizer” and “Mismatch 1”; 
this is because in “Mismatch 2” it is possible to compute 
the post-equalization SINR values required for soft-output 
computation in ( [T7) i, whereas in the other receiver architectures, 
we simply set p^, = 1. 


Fig. [^b) shows that increasing the number of BS antennas 
to 32 (corresponding to a BS-antenna-to-user ratio of 4), 
enables “Quantizer” and “Mismatch 2” to achieve a similar 
SNR operating point when Qb > 5. This shows that the LLR 
approximation in Section |III-C| is accurate for such a BS- 
antenna-to-user ratio, conhrming the observations made in 
ED, In this regime, one should use the “Mismatch 2” 
receiver architecture (which effectively ignores the presence 
of a quantizer), as it does not entail a complexity increase 
compared to conventional algorithms for inhnite-precision MU- 
MIMO-OFDM systems. For lower values of Qb, however, 
“Quantizer” signihcantly outperforms “Mismatch 2” (we do 


not show “Mismatch 1” as it exhibits similar performance to 
“Quantizer”). Also note that with “Quantizer,” we are now able 
to achieve 1% PER for only two quantization bits (i.e., Qb = 2). 

Figs. I^c) and[^d) show that by further increasing the BS- 
antenna-to-user ratio to 8 and 16, respectively, “Mismatch 2” 
delivers the same performance as “Quantizer.” In addition, 
we see that for both receiver architectures, Qt = 4 yields an 
SNR gap to the inhnite-precision ADC case of only 0.25 dB. 
Furthermore, the SNR gap to the SIMO system (with channel 
estimation, but inhnite precision) is less than IdB. All these 
observations have far reaching consequences for quantized 
massive MU-MIMO-OFDM systems with BS-antenna-to-user 
ratio exceeding 8. In particular, we see that the simple and 
low-complexity “Mismatch 2” receiver architecture achieves 
near-optimal performance with only 4 bit precision at virtually 
no complexity increase in baseband processing compared to 
inhnite-precision MU-MIMO-OFDM systems. 

VII. Conclusions 

We have studied the performance of a quantized massive 
MU-MIMO-OFDM uplink operating over frequency-selective, 
wideband channels. We have developed new channel-estimation 
and data-detection algorithms, and studied the associated perfor¬ 
mance/quantization trade-offs. Our simulation results show that 
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the use of four-bit ADCs is sufficient to achieve near-optimal 
performance in massive MU-MIMO-OFDM systems having a 
BS-antenna-to-user ratio of eight or higher. Remarkably, this 
comes at no additional costs in terms of baseband-processing 
complexity, compared to infinite-precision systems. Our results 
imply that coarse quantization enables low-cost and low- 
power massive MU-MIMO-OFDM system implementations 
that achieve near-optimal performance. In addition, reducing 
the precision from 10 bit to 4bit yields 2.5 x lower ADC output 
rates, which is particularly relevant for massive MIMO systems 
where the antenna elements and baseband processing unit are 
spatially separated. 

There are many avenues for future work. The presence 
of a quantizer renders synchronization and noise-variance 
estimation a difficult task; the development of corresponding 
methods is of paramount importance to enable the practical 
deployment of quantized massive MU-MIMO-OFDM systems. 
Extracting post-equalization SINR values for LLR computa¬ 
tion directly via convex optimization may yield significant 
performance improvements; this is an open research problem. 
The application of the algorithms presented in this paper to 
single-carrier frequency-division multiple access (SC-FDMA) 
systems with coarse quantization, which is relevant to the 
3GPP LTE uplink ]59[ , is left for future work. One final 
open issue is whether the proposed quantized MU-MIMO- 
OEDM architecture is compatible with systems that use digital 
filters after the ADCs, as typically done in broadband wireless 
receivers to reduce costs and increase spectrum flexibility. 
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